[Distributed] create model on meta device #1227

kwen2501 · 2024-09-28T08:24:13Z

Change 1

Changing from:

    with device:
        model = Transformer(config)

to

    with torch.device("meta"):
        model = Transformer(config)

because when we later load weights, we will swap out the tensors of the model anyway. So saving an on-device init here.

Change 2

Also added a with device context to model.setup_cache() call. So that the caches are directly created on the target device -- saving a model.to(device) call.

pytorch-bot · 2024-09-28T08:24:16Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1227

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit cb77ba5 with merge base 77bac00 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

metascroy · 2024-09-30T20:04:47Z

Rebase to fix failing tochao_experimental check

lessw2020

lgtm!
Tested with both llama2 and llama3.

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 28, 2024

kwen2501 requested a review from lessw2020 September 28, 2024 08:24

lessw2020 approved these changes Sep 30, 2024

View reviewed changes

kwen2501 changed the base branch from pin_torch to main October 2, 2024 19:30

[Distributed] create model on meta device

cb77ba5

kwen2501 force-pushed the meta_init branch from 3a142f7 to cb77ba5 Compare October 2, 2024 19:33

kwen2501 merged commit 8fcb3ba into main Oct 2, 2024
52 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Distributed] create model on meta device #1227

[Distributed] create model on meta device #1227

Uh oh!

kwen2501 commented Sep 28, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 28, 2024 •

edited

Loading

Uh oh!

metascroy commented Sep 30, 2024

Uh oh!

lessw2020 left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[Distributed] create model on meta device #1227

[Distributed] create model on meta device #1227

Uh oh!

Conversation

kwen2501 commented Sep 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Change 1

Change 2

Uh oh!

pytorch-bot bot commented Sep 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1227

✅ No Failures

Uh oh!

metascroy commented Sep 30, 2024

Uh oh!

lessw2020 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

kwen2501 commented Sep 28, 2024 •

edited

Loading

pytorch-bot bot commented Sep 28, 2024 •

edited

Loading

lessw2020 left a comment •

edited

Loading